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Abstract — This letter proposes a novel method for accelerating 
iterative detection for spatially coupled (SC) systems. An SC 
system is constructed by one-dimensional coupling of many 
subsystems, which are classified into training and propagation 
parts. An irregular structure is introduced into the subsystems in 
the training part so that information in that part can be detected 
successfully. The obtained reliable information may spread over 
the whole system via the subsystems in the propagation part. In 
order to allow the subsystems in the training part to collabo- 
rate, shortcuts between them are created to accelerate iterative 
detection for that part. As an example of SC systems, SC code- 
division multiple-access (CDMA) systems are considered. Density 
Evolution for SC CDMA systems shows that the proposed method 
can provide a significant reduction in the number of iterations 
for highly loaded systems, compared to conventional methods. 

Index Terms — spatial coupling, code-division multiple-access, 
small world, belief propagation, density evolution. 

I. Introduction 

SPATIAL coupling has been proved to improve the 
belief-propagation (BP) performance of conventional low- 
density parity-check (LDPC) codes up to the correspond- 
ing maximum-a-posteriori (MAP) performance |T). This phe- 
nomenon has been observed in many other problems, such as 
code-division multiple-access (CDMA) systems |2)-|I], com- 
pressed sensing 0-0 , and models in statistical physics |8|. 
See [3], |9| for a theoretical treatment of general systems. 

A spatially coupled (SC) system is constructed as a one- 
dimensional chain of L large subsystems. A slightly irregular 
structure, which results in a rate loss, is introduced at both ends 
of the chain so that information at both ends can be detected 
successfully. BP detection consists of two stages: training and 
propagation stages. In the training stage, information at both 
ends is first detected by utilizing the irregularity. We refer to 
positions at which irregularity is imposed as training positions. 
In the propagation stage, on the other hand, the reliable 
information at both ends propagates toward the center of the 
chain. Order preserved in each large subsystem enables error- 
free propagation. Since the rate loss due to the irregularity 
at both ends vanishes as L 4 oo, the best performance is 
achievecQ in that limit. This implies that a long chain should 
be used to approach the MAP performance. 
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When a long chain is used, many iterations are required 
for propagating reliable information over the whole chain. The 
purpose of this letter is to propose a novel method of coupling 
for accelerating the convergence of iterations. A remarkable 
effort oriented in the same direction was made by Truhachev 
et al.: They proposed to divide a long chain into several 
short chains and to re-connect the divided chains in a ladder- 
like [ 1 1 or loop-like ifTTIl structure. The connecting positions, 
along with the end positions, correspond to the training posi- 
tions. Reliable information at the connecting positions spreads 
over the chains simultaneously. As a result, the propagation 
of reliable information over the whole system is accelerated, 
compared to one long chain. 

In this letter, we shall accelerate iterative detection in the 
training stage by creating shortcuts between distant training 
positions. Reliable information at each training position prop- 
agates to the other training positions through the shortcuts, 
whereas there is no collaboration between distant training 
positions for conventional SC systems. This collaborative 
training accelerates iterative detection in the training stage. 

This letter is organized as follows: After summarizing the 
notation and terminology used in this letter, in Section IIT1 we 
focus on SC CDMA systems as an example of SC systems, 
and explain how to connect training positions. Section [Til] 
presents the density-evolution (DE) analysis of BP detection. 
In Section ITVl comparisons between collaborative training and 
non-collaborative training are made in terms of the number of 
iterations. Section [VI concludes this letter. 

For integers I and L, (I) j, is equal to 1 + iL for an integer i 
such that < I + iL < L — 1. The set of consecutive 
integers {i,i + (i < j) is written as [i : j]. The 

vector 1„ denotes the rt-dimensional vector whose elements 
are all one. The Q-function Q(-) is the upper tail probability 
of the standard real Gaussian distribution. The real Gaussian 
distribution with mean m and covariance £ is written as 
Af(m, S). In a graph, the degree of a node is defined as the 
number of edges connected to the node. The distance between 
two nodes is the number of edges in the shortest path that 
connects the two nodes. 

II. System Model 

A. Spatially Coupled CDMA Systems 

We consider synchronous K-user SC CDMA systems over 
the real additive white Gaussian noise (AWGN) channel as an 
example of SC systems. Note that it is possible to apply the 
idea in this letter to the other SC systems. For simplicity in 
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presentation, dense spreading sequences are used, whereas the 
dense limit of sparse spreading sequences should be taken for 
rigorous DE 02]], (13). See for the details. 

Transmission over L symbol periods is considered. Let Ni 
and N = L^ 1 J2^Tq Ni denote the spreading factor in symbol 
period I and the average spreading factor, respectively. The 
received vector y = (yj, . . . , yJ_ 1 ) T £ M. LN is given by 



w, 



y = Sx 

In (Q), x = (scj, . . . , ccJ_ 1 ) T £ {1,-1} LK denotes the 
data symbol vector with binary phase shift keying (BPSK), 
in which x m £ {1, — 1} A consists of the mth data symbols of 
all users. Furthermore, S represents the LN x LK spreading 
matrix, constructed as 



w ~ Af(0,a 2 I). 



(1) 



&0,0'S'0,i 



b L - 



1,0>-»L-1,0 



bo L-lSo L-l 



In (O, {Si. m } are independent Ni x K matrices that have 
independent entries taking ±l/^/N~i with probability 1/2. 
Furthermore, b^ m £ R is the (I, m)-element of the base matrix 
B £ R LxL , which characterizes spatial coupling. Imposing 
Tld=Q m = 1 f° r a ^ m normalizes the average power used 
for transmitting each data symbol. 

The set C = [0 : L — 1] of all symbol periods is decomposed 
into the training phase T C C and the propagation phase 
V C C, which are disjoint subsets of C. Let N tv and N denote 
the spreading factors in the training and propagation phases, 
respectively. Large iVt r is assumed to obtain reliable estimates 
of the data symbols transmitted in the training phase. The 
average load a is defined as 



LK 



a = 



i - 



(3) 



\r\N tT + \r\N 

with t = |T|. In Q, a tl = K/N tl and a = K/N denote 
the loads in the training and propagation phases, respectively. 
Under the BPSK assumption the average load (f3]i is equal to 
the average sum rate, and tends toward a as L — > oo and 
t/L -> 0. 

B. Spatial Coupling & Collaborative Training 

In one-dimensional or circular coupling with coupling 
width W, a circulant matrix is used as the base matrix B: 



h. 



with (6 ,...,&L-i) 



1 



y/2W + 1 



\l-m)i 



(4) 



matrix is written as B 



(reg) 
L.W 



This regular base 
In order to present collaborative 



training, we shall introduce a graph representation of the base 
matrix B. 

The L x L base matrix B can be represented by a bipartite 
graph that consists of L factor nodes and L variable nodes. 
The factor nodes shown by squares correspond to the row 
indices of B, whereas the variable nodes represented by circles 
are associated with the column indices. If the element bi, m is 
non-zero, there is an edge between factor node I and variable 
node m. Otherwise, there is no edge between the two nodes. 




Fig. 1. Regular bipartite graph with L = 32 and W = 2. 



(2) In this letter, we refer to the bipartite graph that represents 
the regular base matrix B^°$l as the regular bipartite graph 



(rcg) 
L,W 



See Fig. Q]for the regular bipartite graph B 



(rcg) 
32,2 ■ 



B 

As an ensemble of collaborative training, we use an ensem- 
ble based on a small-world (SW) network fl4l . which is a 
model for elucidating the so-called SW phenomenon such as 
"six degrees of separation." SW networks are highly clustered, 
as regular networks are, and have short distances between any 
two nodes, as random networks do. These properties of SW 
networks are suitable for accelerating iterative detection in the 
training stage. In this letter, this SW-network-based ensemble 
is referred to as "SW ensemble." 

Let C^° s '(m) C C denote the set of node m and nodes 
with distance 2 from node to in a regular bipartite graph with 
coupling width W. For example, (0) for variable nodes 
is shown by the gray nodes in Fig. Q] The SW ensemble is 
obtained by modifying a regular bipartite graph at positions 
included in equally spaced c clusters {Cfy • (*-^/ c )}i=o- 

1) Let i = and generate the regular bipartite graph B^fy . 

2) Repeat the following for all variable nodes to £ 

(iL/c) in the ith cluster: With probability p, re- 



connect each edge that is connected to variable node m 

to a factor node / £ Uj^iC^^^L/c) in the other 

clusters uniformly and randomly. 
3) If i = c— 1, terminate the algorithm. Otherwise, go back 

to Step 2) after i := i + 1. 
In bipartite graphs obtained by the algorithm above, dif- 
ferent factor nodes may have different degrees, whereas all 
variable nodes have degree 2W + 1, We propose a heuristic 
algorithm for determining the training phase T with size 
r = |T|. The proposed algorithm assigns factor nodes with 
largest degrees to the training phase. 

1) Let f = t, T = 0, and d = d max , with d max denoting 
the maximum degree of the factor nodes. 

2) If the number of factor nodes with degree d is greater 
than or equal to f , pick up f factor nodes from all 
factor nodes with degree d uniformly and randomly, 
add the corresponding indices into T, and terminate the 
algorithm. Otherwise, go to the next step. 

3) Add the indices Cd that represent all factor nodes with 
degree d into T, and go back to Step 2) after f := 
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f — \Cd\ and d := d— 1. 
We refer to the ensembl^ that consists of all instances 
obtained by the two algorithms above as the (L,W,p,c,r)- 
SW ensemble. See Fig. [2] for an instance obtained from the 
SW ensemble. 

Remark 1. The degrees of the factor nodes provide an 
impact on the complexity of iterative detection algorithms. 
One method for making a fair comparison between regular 
and irregular bipartite graphs would be to consider sparsely 
spread CDMA systems in which, for factor nodes with a large 
degree, small row weights are assigned to the correspond- 
ing subsystems. However, this influence on the performance 
vanishes in the dense limit. This argument implies that com- 
parisons presented in this letter are not necessarily unfair in 
terms of the complexity. 

III. Density Evolution 

We consider an iterative detection algorithm based on BP 
with the Gaussian approximation [15]. See [3] for the details. 
In order to evaluate the bit error rate (BER) via DE, we take 
the large-system limit in which K and {N{\ tend to infinity 
with the ratios ai = K/Ni fixed for all I. 

Theorem 1. In the large-system limit, the BER of each element 
of the mth data symbol vector x m in iteration i is given 
by Q(\/sir m (i)), in which {sir m (i)} are determined via the 
coupled DE equations, 



six m (i) = 



1=0 



(5) 



L-l 



a?(i) = o 2 + m &£ m MMSE(sir m (i - 1)), (6) 

m=0 

with sir m (0) = for all m. In (0, the function MMSE(.t) 
denotes the minimum mean-squared error (MMSE) of the 
BP SK-input AWGN channel with signal-to-noise ratio (SNR) x. 

2 The ensemble with L = 64, W = 2, and c = 2 contains all bipartite 
graphs obtained by re-connection of the edges in Fig. f2] However, their 
occurrence probabilities are non-uniform and determined on the basis of 
the probability p, whereas instances contained in conventional ensembles are 
assumed to occur uniformly and randomly. 



Fig. 3. BER versus m/L for the bipartite graph of coupling obtained 
from the (64, 2, 0.1, 2, 14)-SW ensemble, shown in Fig. [2] 1/a 2 = 10 dB, 
a tr = 1.45, and a = 1.98. 



Proof The proof is a generalization of the results in [3 1 
and is therefore omitted. ■ 
When a bipartite graph of coupling and the load a tI in the 
training phase are given, the DE equations (0 and §6^ have a 
unique fixed-point for small SNR 1/cr 2 or small load a in the 
propagation phase. On the other hand, there may be multiple 
fixed-points for large SNR and large load. The BP threshold 
for fixed SNR is defined as the maximum load such that the 
DE equations have a unique fixed-point. 

Definition 1. When a% r , a bipartite graph of coupling, and 
SNR are fixed, the BP threshold is defined as the supremum 
of a c such that the DE equations @ and ([6]) have a unique 
fixed-point for all a < a c . 

The operational meaning of the BP threshold is that, when a 
is smaller than the BP threshold, the BP-based iterative algo- 
rithm can eliminate multiple-access interference (MAI), and 
achieve performance close to the single-user bound. On the 
other hand, the system is MAI-limited when a is larger than 
the BP threshold. When 1/cr 2 = 10 dB, the BP threshold a B p 
for the uncoupled CDMA system is given by «bp ~ 1.73078. 
As L and W tend to infinity with W/L —> 0, on the other 
hand, the corresponding BP threshold agp S ' for the circularly 
coupled CDMA system tends toward the optimal threshold 
«map « 1.98267 0. 



IV. Numerical Comparisons 

We shall make comparisons between regular coupling and 
the SW ensemble. Bipartite graphs obtained from the SW 
ensemble have distinctly different performance instance by 
instance for small-sized graphs, so that we used good instances 
obtained by numerical sampling from the SW ensemble. 

Figure [3] shows the evolution of the BERs for the bipartite 
graph of coupling shown in Fig. [2] We find that the data 
symbols in the two clusters C^' cg) (0) and C^ rcg) (32) are 
detected first. Their BERs are not constant, because of the 
irregularity of the bipartite graph in the clusters. Then, the 



IEEE COMMUNICATIONS LETTERS, VOL. , NO. , 2012 



& 10 




200 250 300 
Number of iterations 



Fig. 4. Average BER versus the number of iterations for W = 2, 1/u 2 = 
10 dB, and «tr = 1.45. For the regular coupling with L = 64 (resp. L = 
128), 14 (resp. 28) symbol periods T = {[61 : 63], [0:3], [29 : 35]} (resp. 
T = {[125 : 127], [0:3], [29 : 35], [61 : 67], [93 : 99]}) are assigned to the 
training phase. The bipartite graph of coupling shown in Fig. [2] was used for 
the (64, 2,0.1, 2, 14)-SW ensemble. 



reliable information propagates toward the middle positions 
between the two clusters. Eventually, the BERs at all positions 
tend to a small level of 10 -3 . 

Remark 2. For convenience in presentation, we have assigned 
symbol periods in C^ ^ (0) U (32) to the training phase. 
The performance is of course invariant under the permutations 
of rows in the base matrix. This invariance may allow us to use 
a low-complexity implementation of iterative detection, such 
as sliding window decoding [16]. However, such a study is 
beyond the scope of this letter. 

Figure |4] presents comparisons between the regular coupling 
and the SW ensembles. For a = 1.9 much smaller than 
the optimal threshold «map ~ 1.98267, three methods of 
coupling are indistinguishable from each other. For a — 1.98 
close to the optimal threshold, on the other hand, the average 
BERs for the SW ensembles converge toward a small BER 
more quickly than that for the regular coupling. It is worth 
noting that the (64, 2, 0.1, 2, 14)-SW ensemble outperforms 
the (128,2,0.1,4,28)-SW ensemble. This implies that there 
is no point in using the SW ensembles with c > 2, or that it is 
difficult to find a good instance from such SW ensembles. In 
order to investigate the reasons of the quick convergence, we 
focus on the minimum BERs over all m — 0, . . . , L — 1, shown 
by the lines with pluses, which should be at a training position. 
We find that the minimum BERs for the SW ensembles 
converge more quickly than that for the regular coupling. This 
quick convergence in the training phase reduces the overall 
number of iterations. Another reason is a slight improvement 
of the BP threshold: The BP thresholds for the regular coupling 
and the (64, 2, 0.1, 2, 14)-SW ensemble are approximately 
1.98958 (a « 1.83981) and 1.99911 (a « 1.84617), respec- 
tively. 



V. Conclusions 

The SW ensemble of coupling has been proposed to 
accelerate iterative detection in the training stage for SC 
CDMA systems. Instances picked up from the ensemble have 
direct connections between distant training positions. The DE 
analysis has shown that the proposed method can provide a 
significant reduction in the number of iterations for highly 
loaded systems, compared to the conventional method of 
coupling. We conclude that irregular coupling in the training 
phase can accelerate iterative detection for SC systems. 
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